Quotes as Data Extracting Political Statements from Dutch Newspapers by applying Transformation Rules to Syntax Graphs
نویسنده
چکیده
To understand the relation between media and politics, it is necessary to study the content of politicians’ statements in the news. This paper presents a method to automatically extract such statements by applying graph transformation rules to the syntactic structure of Dutch newspaper sentences. It also shows how politicians can be identified using a dictionary approach and anaphora resolution. The method is validated using manual verification, yielding good precision (86%) and recall (82%) for the extraction of quotes and decent recall (73%) for the identification of politicians. This shows that the method presented here performs sufficiently for investigating political statements in the news on a large scale.
منابع مشابه
Reverse Engineering of Network Software Binary Codes for Identification of Syntax and Semantics of Protocol Messages
Reverse engineering of network applications especially from the security point of view is of high importance and interest. Many network applications use proprietary protocols which specifications are not publicly available. Reverse engineering of such applications could provide us with vital information to understand their embedded unknown protocols. This could facilitate many tasks including d...
متن کاملApplying Crawford and Ostrom’s Grammar
In 1995, Crawford and Ostrom proposed a grammatical syntax for examining institutional statements (i.e., rules, norms, and strategies) as part of the institutional analysis and development framework. This article constitutes the first attempt at applying the grammatical syntax to code institutional statements using two pieces of U.S. legislation. The authors illustrate how the grammatical synta...
متن کاملPolitical leaders and the media. Can we measure political leadership images in newspapers using computer-assisted content analysis?
Despite the large amount of research into both media coverage of politics as well as political leadership, surprisingly little research has been devoted to the ways political leaders are discussed in the media. This paper studies whether computer-aided content analysis can be applied in examining political leadership images in Dutch newspaper articles. It, firstly, provides a conceptualization ...
متن کاملIntroducing a method for extracting features from facial images based on applying transformations to features obtained from convolutional neural networks
In pattern recognition, features are denoting some measurable characteristics of an observed phenomenon and feature extraction is the procedure of measuring these characteristics. A set of features can be expressed by a feature vector which is used as the input data of a system. An efficient feature extraction method can improve the performance of a machine learning system such as face recognit...
متن کاملQuantifying "Pillarization": Extracting Political History from Large Databases of Digitized Media Collections
We analyzed long-term dynamic developments in newspaper content in connection with the process of pillarization (the segmentation of Dutch society and politics along religious/ideological cleavages) over the period 1918– 1967. One of the main characteristics of the historical debate on this phenomenon is an alleged close connection of political and media organizations on personnel, organization...
متن کامل